SQUARE: A Benchmark for Research on Computing Crowd Consensus
نویسندگان
چکیده
While many statistical consensus methods now exist, relatively little comparative benchmarking and integration of techniques has made it increasingly difficult to determine the current state-of-the-art, to evaluate the relative benefit of new methods, to understand where specific problems merit greater attention, and to measure field progress over time. To make such comparative evaluation easier for everyone, we present SQUARE, an open source shared task framework including benchmark datasets, defined tasks, standard metrics, and reference implementations with empirical results for several popular methods. In addition to measuring performance on a variety of public, real crowd datasets, the benchmark also varies supervision and noise by manipulating training size and labeling error. We envision SQUARE as dynamic and continually evolving, with new datasets and reference implementations being added according to community needs and interest. We invite community contributions and participation.
منابع مشابه
Actively Estimating Crowd Annotation Consensus
The rapid growth of storage capacity and processing power has caused machine learning applications to increasingly rely on using immense amounts of labeled data. It has become more important than ever to have fast and inexpensive ways to annotate vast amounts of data. With the emergence of crowdsourcing services, the research direction has gravitated toward putting the wisdom of crowds to bette...
متن کاملSQUARE: Benchmarking Crowd Consensus at MediaEval
We extend the square benchmark for statistical consensus methods to include additional evaluation on two datasets from the MediaEval 2013 Crowdsourcing in Multimedia shared task. In addition to reporting shared task results, we also analyze qualitatively and quantitatively performance of consensus algorithms under varying supervision. 1. ALGORITHMS We extend square [5], a benchmark for evaluati...
متن کاملEntropy-based Consensus for Distributed Data Clustering
The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...
متن کاملAudioSentibank: Large-scale Semantic Ontology of Acoustic Concepts for Audio Content Analysis
Audio carries substantial information about the content of our surroundings. The content has been explored at the semantic level using acoustic concepts, but rarely on concept pairs such as happy crowd and angry crowd. Concept pairs convey unique information and complement other audio and multimedia applications. Hence, in this work we explored for the first time the classification’s performanc...
متن کاملPerformance Analysis of a Flash-Crowd Management System
Flash-crowds are a growing obstacle to the further expansion of the Internet. One of the solutions to this problem is to replicate the most popular documents to different web servers and to redirect client requests to these replicas. In this thesis we present a performance analysis of a flash-crowd management system based on RaDaR. We adjust the architecture of RaDaR to focus more on adaptabili...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013